RAW SEQUENCE LISTING 
ERROR REPORT 

*• A 

The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 10 loSS, 2^ 2. 

Source: TtTlO 

Date Processed by STIC: Z\ zs\ Q?> ^ ^ 

. in. 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMlSSIOPfcQ4JESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTTN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://wvnv.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htin> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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Raw Sequence Listing Error Summary 



ERROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA- HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

• *" * 

1 Wrapped Nucleics The numberAcxt at the end of each line "wrapped** down to the next line. This fc&y occur ifyour file 

Wrapped Aminos was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent -wrapping,** 

2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

3 Misaligned Amino The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 

Numbering use space characters, instead. 



_Non- ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text 

_Variable Length ' 1 r 1 4 Sequencc(s) contain n*s or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 



Patentln 2.0 
"bug** 



_Skippcd Sequences 
(OLD RULES) 



_Skipped Sequences 
(NEW RULES) 



A "bug** in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknqy sequences. 

$equence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X**,is shoWn) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Sequencers) _ 



<210> sequence id number 
<400> sequence id number 
000 



missing. If Intentional, please insert the following lines for each skipped sequence. 



_Use of n's or Xaa*s 
(NEW RULES) 



10 \/ Invalid <213> 
Response 



II 



Useof<220> 



12 



JPatentln 2.0 
"bug** 



Use of n*s and/or Xaa*s have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa*s are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

— /■ 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequences) _ 



_ missing the <220> "Feature** and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism** response is "Artificial Sequence** or 
"Unknown.** Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register,** 06701/1998, Vol. 63, No. 104, pp. 29631-32) (Sec, 1.823 of Sequence Rules) 

Please do not use "Copy to Disk** function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager** or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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RAW SEQUENCE LISTING DATE: 02/25/2003 

PATENT APPLICATION: US/10/089,292 TIME: 12:11:36 

Input Set : A:\Serial No. 10-089,292 Sequence Listing 
Output Set: N:\CRF4\02252003\J089292.raw 

3 <110> APPLICANT: NG, Mun Hon 

4 IM, Stanley 

5 ZHANG, Ji-Zhong 

7 <120> TITLE OF INVENTION: NOVEL HEV ANTIGENIC PEPTIDE AND METHODS 
9 <130> FILE REFERENCE: IEC010036US 
C — > 11 <140> CURRENT APPLICATION NUMBER: US/10/089 , 292 

12 <141> CURRENT FILING DATE: 2000-09-28 

14 <160> NUMBER OF SEQ ID NOS: 3 

16 <170> SOFTWARE: Patent In version 3.1 

18 <210> SEQ ID NO: 1 

19 <211> LENGTH: 642 

20 <212> TYPE: DNA 

21 <213> ORGANISM: Hepatitis E virus 

23 <4 00> SEQUENCE: 1 

24 cagctgttct actctcgtcc cgtcgtctca gccaatggcg agccgactgt taagctttat 60 
26 acatctgtag agaatgctca gcaggataag ggtattgcaa tcccgcatga catcgacctc 120 
28 ggggagtctc gtgtagttat tcaggattat gacaaccaac atgagcagga ccgaccgaca 180 
30 ccttccccag ccccatcgcg ccctttttct gtcctccgag ctaatgatgt gctttggctt 240 
32 tctctcaccg ctgccgagta tgaccagtcc acttacggct cttcgaccgg cccagtctat 300 
34 gtctctgact ctgtgacctt ggttaatgtt gcgaccggcg cgcaggccgt tgcccggtca 360 
36 ctcgactgga ccaaggtcac acttgatggt cgcccccttt ccaccatcca gcagtattca 420 
38 aagaccttct ttgtcctgcc gctccgcggt aagctctcct tttgggaggc aggtactact 480 
40 aaagccgggt acccttataa ttataacacc actgctagtg accaactgct cgttgagaat 540 
42 gccgctgggc atcgggttgc tatttccact tacaccacta gcctgggtgc tggtcccgtc 600 
44 tctatttccg cggttgctgt tttagccccc cctccgcgct ag 642 
4 7 <210> SEQ ID NO: 2 

48 <211> LENGTH: 213 

49 <212> TYPE: PRT 

50 <213> ORGANISM: HEV 
52 <4 00> SEQUENCE: 2 



54 


Gin 


Leu 


Phe 


Tyr 


Ser 


Arg 


Pro 


Val 


Val 


Ser 


Ala 


Asn 


Gly 


Glu 


Pro 


Thr 


55 


1 








5 










10 










15 




58 


Val 


Lys 


Leu 


Tyr 


Thr 


Ser 


Val 


Glu 


Asn 


Ala 


Gin 


Gin 


Asp 


Lys 


Gly 


He 


59 








20 










25 










30 






62 


Ala 


He 


Pro 


His 


Asp 


He 


Asp 


Leu 


Gly 


Glu 


Ser 


Arg 


Val 


Val 


He 


Gin 


63 






35 










40 










45 








66 


Asp 


Tyr 


Asp 


Asn 


Gin 


His 


Glu 


Gin 


Asp 


Arg 


Pro 


Thr 


Pro 


Ser 


Pro 


Ala 


67 
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55 










60 










70 


Pro 


Ser 


Arg 


Pro 


Phe 


Ser 


Val 


Leu 


Arg 


Ala 


Asn 


Asp 


Val 


Leu 


Trp 


Leu 


71 


65 










70 










75 










80 


74 


Ser 


Leu 


Thr 


Ala 


Ala 


Glu 


Tyr 


Asp 


Gin 


Ser 


Thr 


Tyr 


Gly 


Ser 


Ser 


Thr 


75 
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90 










95 




78 


Gly 


Pro 


Val 
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Val 


Ser 
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Ser 


Val 


Thr 


Leu 


Val 
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Val 


Ala 


Thr 
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RAW SEQUENCE LISTING DATE: 02/25/2003 

PATENT APPLICATION: US/10/089 , 292 TIME: 12:11:36 



Input Set : A:\Serial No. 10-089,292 Sequence Listing 
Output Set: N:\CRF4\02252003\J089292.raw 
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82 


Gly 


Ala Gin 


Ala 


Val 


Ala 


Arg 


Ser 


Leu 


Asp 


Trp 


Thr 
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Val 


Thr 


Leu 


83 




115 










120 










125 








86 


Asp 


Gly Arg 


Pro 


Leu 


Ser 


Thr 


He 


Gin 


Gin 


Tyr 


Ser 


Lys 


Thr 


Phe 


Phe 


87 
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135 










140 










90 


Val 


Leu Pro 


Leu 


Arg 


Gly 


Lys 


Leu 


Ser 


Phe 


Trp 


Glu 


Ala 


Gly 


Thr 


Thr 


91 
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94 
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Ala Gly 
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Pro 
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Tyr 
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Thr 
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Leu 


Val Glu 


Asn 


Ala 


Ala 


Gly 


His 


Arg 


Val 


Ala 


He 


Ser 


Thr 


Tyr 


Thr 


99 






180 










185 










190 







102 Thr Ser Leu Gly Ala Gly Pro Val Ser He Ser Ala Val Ala Val Leu ■ 

103 195 200 205 

106 Ala Pro Pro Pro Arg 

107 210 

110 <210> SEQ ID NO: 3 

111 <211> LENGTH: 34 

112 <212> TYPE: DNA f/YLfcUol V€A pCJ^ff _ gjtfL Q/W^ <5Uwr 

113 <213> ORGANISM :Q)NA (genomic)) ^ <A h W in 

115 <400> SEQUENCE: 3 Rem ,U 

116 ggcgaatccc tagcgcggag ggggggctaa aaca 34 



file://C:\CRF4\Outhold\VsrJ089292.htm 



VERIFICATION SUMMARY DATE: 02/25/2003 

PATENT APPLICATION: US/10/089,292 TIME: 12:11:37 

Input Set : A:\Serial No. 10-089,292 Sequence Listing 
Output Set: N:\CRF4\02252003\J089292.raw 

L:ll M:270 C: Current Application Number differs, Replaced Current Application Number 
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